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METHOD AND APPARATUS FOR COPYING DATA 
THAT RESIDES IN A DATABASE 

FIELD OF THE INVENTION 
5 The present invention relates to the copying of data and more specifically to 

producing a copy of data that resides in a database. 

BACKGROUND OF THE INVENTION 

Planning software is used by manufacturers to aid in the manufacturing 

10 process. Based upon the desired product output and the components needed for each 
product, the planning software generates a schedule of what components need to be 
manufactured and by when, and what materials need to be procured and by when. 
This schedule is generated based upon data stored in a database. The planning 
process is often complicated and, for complex products, can take many hoxirs to 

15 complete. 

A typical requirement imposed by the planning software is that it needs to do 
its processing based on a single consistent version of the database. If one process of 
the planning software is reading one version of the database while another process is 
reading an updated version of the database, serious errors, such as double counting, 
20 can occur. As a result, planning software requires that data be provided ftom a 
single frozen version of the database in order to operate properly. 

As is well known, a database or a selected subset thereof can be frozen by 
obtaining exclusive locks on all of the tables in the database or the selective subset. 
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Once locked, the tables can be processed by the planning software to carry out the 
planning process. However, as noted above, the planning process can take many 
hours to complete. Many companies, especially those having offices around the 
world, cannot afford to lock their tables for extended periods of time. Hence, 

5 locking tables in this maimer is often not a viable solution. 

Another possible solution is to simply make a copy of the database prior to 
running the planning software. The problem with this solution is that for large 
databases, the copying process itself can take several hours. During this time, the 
tables need to be locked to ensure a frozen state. As long as the tables are locked, no 

10 updates can be made. Hence, this solution suffers from the same shortcomings, 
albeit to a lesser degree, as the locking solution. 

Based on the foregoing, it is clearly desirable to provide a mechanism for 
obtaining a single frozen version of the database, or a subset thereof, without locking 
tables in the database for an extended period of time. 

15 
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SUMMARY OF THE INVENTION 

According to one aspect of the invention, a method and apparatus for 
supplying a consistent set of data to a software application is provided. 

According to the method, a software application is launched that requires a 
5 particular set of data contained in a first database. Once the particular set of data is 
identified, a first process is requested to obtain a snapshot time from a database 
server associated with the first database. The snapshot time causes all subsequent 
reads of the first database by the first process to return data that reflects a database 
state associated with the snapshot time. After the first process obtains the snapshot 
10 time, the first process extracts the particular set of data from the first database. The 
software application is then supplied with the particular set of data that was extracted 
from the first database. 

In one embodiment, a second process is used to store the particular set of 
data that was extracted into a second database. 
1 5 According to another aspect of the invention, a method and apparatus for 

producing a copy of data from a first database is provided. 

According to the method, a first set of data in the first database is locked. 
After locking the first set of data, a plurality of processes are requested to obtain 
snapshot times from a database server associated vidth said first database. The 
20 snapshot times cause all subsequent reads of the first database by the plurality of 
processes to return data from the first database as of the snapshot times. After 
waiting a particular period of time for the plurality of processes to be assigned 
snapshot times, the locks on the first set of data in the first database are released. 
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The plurality of processes that were successful in obtaining a snapshot time within 
the particular period of time are used to extract a copy of the first set of data from 
the first database. The copy of the first set of data is then separately stored fi'om the 
first of data. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

The present invention is illustrated by way of example, and not by way of 
limitation, in the figures of the accompanying drawings and in which like reference 
numerals refer to similar elements and in which: 
5 Figure 1 is a block diagram of a system that may be programmed to 

implement the present invention; 

Figure 2 is a block diagram of a system that is used for producing a copy of a 
database in accordance with an embodiment of the present invention; 

Figure 2A illustrates the use of a snapshot time in accordance with an 
1 0 embodiment of the invention; 

Figure 2B illustrates the locking a database to obtain snapshot times that 
correspond to a single database state; 

Figure 3 A is a portion of a flow diagram illustrating a method for 
producing a copy of a database without retaining a lock on the database tables 
1 5 according to an embodiment of the invention; 

Figure 3B is another portion of a flow diagram illustrating a method for 
producing a copy of a database without retaining a lock on the database tables 
according to an embodiment of the invention; 

Figure 4 is a block diagram of a system that is used for producing a 
20 copy of a database in accordance with an embodiment of the present invention; 

Figure 5 A is a portion of a flow diagram illustrating a method for 
producing a copy of a database without locking the database tables; 



50277-177 (3018-166) 



Figure 5B is another portion of a flow diagram illustrating a method for 
producing a copy of a database without locking the database tables; and 

Figure 6 is a block diagram of alternate system for producing a copy of 
a database in accordance with an embodiment of the present invention. 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT 

A method and apparatus for producing a consistent copy of a database, or 
portion thereof, is described. In the following description, for the purposes of 
explanation, numerous specific details are set forth in order to provide a thorough 
5 imderstandmg of the present invention. It will be apparent, however, to one skilled 
in the art that the present invention may be practiced without these specific details. 
In other instances, well-known structures and devices are shown in block diagram 
form in order to avoid unnecessarily obscuring the present invention. 

10 HARDWARE OVERVIEW 

Figure 1 is a block diagram that illustrates a computer system 100 upon which 
an embodiment of the invention may be implemented. Computer system 100 includes 
a bus 102 or other communication mechanism for communicating information, and a 
processor 104 coupled with bus 102 for processing information. Computer system 

15 1 00 also includes a main memory 1 06, such as a random access memory (RAM) or 
other dynamic storage device, coupled to bus 102 for storing information and 
instructions to be executed by processor 104. Main memory 106 also may be used for 
storing temporary variables or other intermediate information during execution of 
instructions to be executed by processor 104. Computer system 100 further includes a 

20 read only memory (ROM) 108 or other static storage device coupled to bus 102 for 
storing static information and instructions for processor 104. A storage device 110, 
such as a magnetic disk or optical disk, is provided and coupled to bus 102 for storing 
information and instructions. 
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Computer system 100 may be coupled via bus 102 to a display 112, such as a 
cathode ray tube (CRT), for displaying information to a computer user. An input 
device 114, including alphanumeric and other keys, is coupled to bus 102 for 
communicating information and command selections to processor 104. Another type 
of user input device is cursor control 116, such as a mouse, a trackball, or cursor 
direction keys for communicating direction information and command selections to 
processor 104 and for controlling cursor movement on display 1 12. This input device 
typically has two degrees of freedom in two axes, a first axis (e.g., x) and a second 
axis (e.g., y), that allows the device to specify positions in a plane. 

The invention is related to the use of computer system 100 for producing a 
copy of a database. According to one embodiment of the invention, a copy of a 
database is produced by computer system 100 in response to processor 104 
executing one or more sequences of one or more instructions contained in main 
memory 106. Such instructions may be read into main memory 106 from another 
computer-readable medium, such as storage device 110. Execution of the sequences 
of instructions contained in main memory 106 causes processor 104 to perform the 
process steps described herein. In altemative embodiments, hard-wired circuitry 
may be used in place of or in combination v^th software instructions to implement 
the invention. Thus, embodiments of the invention are not limited to any specific 
combination of hardware circuitry and software. 

The term "computer-readable medium" as used herein refers to any medium 
that participates in providing instructions to processor 104 for execution. Such a 
medium may take many forms, including but not limited to, non-volatile media. 
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volatile media, and transmission media. Non-volatile media includes, for example, 
optical or magnetic disks, such as storage device 110, Volatile media includes 
dynamic memory, such as main memory 106. Transmission media includes coaxial 
cables, copper wire and fiber optics, including the wires that comprise bus 102. 
5 Transmission media can also take the form of acoustic or light waves, such as those 
generated during radio-wave and infra-red data communications. 

Common forms of computer-readable media include, for example, a floppy 
disk, a flexible disk, hard disk, magnetic tape, or any other magnetic medium, a CD- 
ROM, any other optical medium, punchcards, papertape, any other physical medium 

1 0 with patterns of holes, a RAM, a PROM, and EPROM, a FLASH-EPROM, any 

other memory chip or cartridge, a carrier wave as described hereinafter, or any other 
medium from which a computer can read. 

Various forms of computer readable media may be involved in carrying one or 
more sequences of one or more instructions to processor 104 for execution. For 

1 5 example, the instructions may initially be carried on a magnetic disk of a remote 
computer. The remote computer can load the instructions into its dynamic memory 
and send the instructions over a telephone line using a modem. A modem local to 
computer system 100 can receive the data on the telephone line and use an infra-red 
transmitter to convert the data to an infra-red signal. An infra-red detector can receive 

20 the data carried in the infra-red signal and appropriate circuitry can place the data on 
bus 102. Bus 102 carries the data to main memory 106, from which processor 104 
retrieves and executes the instructions. The instructions received by main memory 
106 may optionally be stored on storage device 110 either before or after execution by 
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processor 104. 

Computer system 100 also includes a communication interface 118 coupled 
to bus 102. Communication interface 118 provides a two-way data communication 
coupling to a network link 120 that is connected to a local network 122. For 
5 example, communication interface 118 may be an integrated services digital network 
(ISDN) card or a modem to provide a data communication connection to a 
corresponding type of telephone line. As another example, conmiunication interface 
1 1 8 may be a local area network (LAN) card to provide a data communication 
connection to a compatible LAN. Wireless links may also be implemented. In any 

10 such implementation, communication interface 118 sends and receives electrical, 

electromagnetic or optical signals that carry digital data streams representing various 
types of information. 

Network link 120 typically provides data communication through one or 
more networks to other data devices. For example, network link 120 may provide a 

15 connection through local network 122 to a host computer 124 or to data equipment 
operated by an Internet Service Provider (ISP) 126. ISP 126 in turn provides data 
communication services through the world wide packet data communication 
network now commonly referred to as the "Internet" 128. Local network 122 and 
Internet 128 both use electrical, electromagnetic or optical signals that carry digital 

20 data streams. The signals through the various networks and the signals on network 
link 120 and through communication interface 118, which carry the digital data to 
and from computer system 100, are exemplary forms of carrier waves transporting 
the information. 
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Computer system 100 can send messages and receive data, including 
program code, through the network(s), network link 120 and communication 
interface 118. In the Internet example, a server 130 might transmit a requested code 
for an application program through Internet 128, ISP 126, local network 122 and 
5 communication interface 118. In accordance with the invention, one such 

downloaded application provides for producing a copy of a database as described 
herein. 

The received code may be executed by processor 104 as it is received, and/or 
stored in storage device 1 10, or other non-volatile storage for later execution. In this 
1 0 manner, computer system 1 00 may obtain application code in the form of a carrier 
wave. 

FUNCTIONAL OVERVIEW 
The present invention provides a mechanism for producing a consistent copy 
15 of a database, or portion thereof, without locking the database or desired portion for 
an extended period of time. For the purpose of explanation, embodiments of the 
invention shall be described with reference to a relational database that stores data in 
tables. The database portion of which a consistent copy is required is therefore 
referred to herein as the " desired tables" . However, the techniques described herein 
20 are not limited to tables or relational databases. 

In one embodiment, the foUovraig steps are performed to obtain a snapshot 
(consistent copy) of a database: 

(1) the desired tables (e.g. the tables that contain the information required by 
a memory-based planner) are identified; 
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(2) the desired tables are locked to prevent them from being updated; 

(3) a coordinator requests a plurality of snapshot worker processes to obtain 
snapshot times, where the snapshot times allow the snapshot workers to obtain data 
from the database as of a particular state of the database; 

5 (4) the coordinator then waits a certain period of tune for the requested 

plurality of snapshot workers to obtain snapshot times; 

(5) the coordinator then releases the locks on the desired tables; 

(6) using the snapshot workers that obtained a snapshot time before the locks 
were released (the " successfiil snapshot workers"), a copy of the desired tables is 

1 0 produced which represents the state of the database at the time desired tables were 
locked. 

SYSTEM OVERVIEW 
Figure 2 depicts a block diagram of a system 200 that is used for producing a 
15 copy of desired tables in accordance with an embodiment of the present invention. 

As depicted in Figure 2, a memory based planner 202 has been launched by a 
user to generate a planning schedule that is based upon a snapshot of database 204. 
Typically, generation of a planning schedule will only require data from a subset of 
the tables contained in database 204. Therefore, a copy table list 218 is maintained 
20 which lists the database tables (1 ..N) in database 204 that are required in the 
generation of the planning schedule (the "desired tables"). 

A coordinator 216 interfaces with individual worker processes, snapshot 
workers (206-214), loader workers (230-238) and delete workers(242-248), and is 
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responsible for the overall coordination of producing a copy of database 204 for use 
in generation of the planning schedule. 

OPERATION OF SYSTEM 200 
5 To produce a snapshot of database 204^ data copy table 218 is generated 

which identifies the desired set of data tables (1..N) in database 204. The desired set 
of tables listed in data copy table 21 8 are the tables that contain information that is 
needed by the memory based planner 202 in order to generate the planning schedule. 
Using the information contained in the data copy table 218, the coordinator 216 

1 0 locks each desired table (1 ..N). The locks thus obtained prevent other processes from 
modifying the desired tables (1..N). 

Once the locks are obtained, the coordinator 216 requests a plurality of 
snapshot workers (206, 208, 210, 212, 214 and 215) to obtain a snapshot time firom 
the database management system (DBMS). Snapshot times are assigned by the 

1 5 database management system, and are used by the database management system to 
determine what version of data to supply to processes. For example, the version of 
data items that are supplied to a process assigned a snapshot time of Tl will reflect 
the state of those data items as of time Tl, even if the data items have been 
subsequently modified. 

20 Snapshot times are used to guarantee that a particular version of data is given 

to a process (i.e. snapshot worker). Thus, if a process requests data that has been 
modified since obtaining its snapshot time, the DBMS reconstructs the state of data 
as of the snapshot time and supplies the reconstructed data to the process. 
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For example, Figxire 2A illustrates the use of a snapshot time in accordance 
with an embodiment of the mvention. In Figure 2A, DB_A, DB__B, DB_C, DB_D, 
DB_E and DB_F respectively represent different states of database 204 at time TO, 
Tl , T2, T3, T4 and T5. If a snapshot time is issued at time TO and a read is 
5 subsequently performed at time T3, the DBMS will respond to the read by returning 
data as it existed in the database as of time TO (i.e. DB_A). The DBMS may achieve 
this using a variety of mechanisms. According to one embodiment, the DBMS uses 
log entries to reconstruct the state of the requested data items to reflect their state at 
time TO. In effect, the snapshot time enables a process to see a snapshot of a 

1 0 database as of a particular time. 

One mechanism by which a DBMS can reconstruct a particular state of the 
database is described in detail in U.S Patent Applications "Method and Apparatus 
for Providing Isolation Levels in a Database", having Serial Number 08/613,026, 
filed March 11, 1996; "Techniques for Providing the Number of Snapshots of a 

15 Database", havmg Serial Number 08/838,967, filed April 23, 1997, and "Dynamic 
Snapshot Set Adjustment", having Serial Number 08/841,541, filed April 23, 1997, 
which are hereby incorporated by reference. 

After the coordinator 216 requests the plurality of snapshot workers (206, 
208, 210, 212, 214 and 215) to obtain snapshot times, it then waits for a certain 

20 period of time (T_WAIT) for the snapshot times to be assigned to the snapshot 

workers by the DBMS. The snapshot times that the DBMS assigns to the snapshot 
workers approximately reflect the times at which the snapshot time assignments are 
made. Since the snapshot times are being assigned after the desired tables are 
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locked, and before the locks on the desired tables are released, the snapshot times 
assigned to the snapshot workers will correspond to a time at which the desired 
tables were locked. 

In certain embodiments, the snapshot workers obtain snapshot times by 
5 executing a "set transaction read only" command. Once a process issues a "set 

transaction read only" command to the DBMS, that process is ensured by the DBMS 
that all subsequent reads by that process for that transaction will return data that was 
in the database as of the time the "set transaction read only" command was issued. 

In certain database systems, using the "set transaction read only" command 
1 0 has certain limitations. Specifically, the process that issued the command cannot 

perform any database modifications (defined as any inserts, updates, or deletes). If it 
does perform database modifications, it loses the benefits of the command (i.e. loses 
its snapshot time). Because of this, it is not possible to use the "set transaction read 
only" command with a transaction that makes copies within a database of the 
1 5 relevant tables (since copying would involve writing into a database). 

To circvimvent this problem, the successful snapshot workers read the data 
from the database, and then write the data to one or more flat files outside of the 
database. Because writing to a flat file is an operating system command, and not a 
DBMS command, it does not constitute a "write" operation as far as the DBMS is 
20 concerned. Hence, the snapshot times obtained through the "set transaction read 
only" command remains in effect for each snapshot worker. 

When the (T_WAIT) time period expires, the coordinator 216 releases its 
locks on the tables. At this time, some of the snapshot workers that were requested 
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to obtain snapshot times may not yet have been assigned snapshot times. To ensure 
that all processes that participate in the creation of copies of the relevant tables are 
seeing the same version of the relevant tables, only those snapshot workers that have 
been assigned snapshot times prior to the expiration of T_WAIT (the " successful 
5 snapshot workers" ) are used in retrieving data from database 204. For the purpose 
of explanation, it shall be assumed that snapshot workers 206-214 successfully 
obtained snapshot times, and that snapshot worker 215 did not. 

Because the tables identified in the copy table list 218 were locked at the 
time the successful snapshot workers were assigned their snapshot times, the 

1 0 successful snapshot workers are guaranteed that when they retrieve data from the 
desired table, they will retrieve data which corresponds to a single database state. 

For example, Figxore 2B illustrates locking database 204 to obtain snapshot 
times that correspond to a single database state in accordance with an embodiment of 
the invention. In Figure 23, DB_A, DB_B and DB_C represent different states of 

1 5 the relevant tables within database 204. In this example, the desired tables in 

database 204 are locked by coordinator 216 at time Tl for a T_WAIT period of "4", 
and, therefore cannot be updated by another process until T5. The coordinator 216 
maintains its locks on the desired tables in database 204 until T5. Thus, if snapshot 
workers 206, 208 and 210 obtain snapshot times at Tl, T3 and T4, they will all 

20 retrieve data which corresponds to a single database state (DB_B) from the desired 
tables. 

In one embodiment, the successful snapshot workers are not required to wait 
until the end of the T_WAIT period of time before they begin to retrieve data from 
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database 204. For example, referring to Figure 2B, if a snapshot worker obtains a 
snapshot time at T2, the snapshot worker may begin to retrieve data from the desired 
tables before the tables are unlocked at T5. 

Using the snapshot times, the successful snapshot workers (206, 208, 210, 

5 212 and 214) retrieve data from database 204 and copy it into a pluraUty of flat files 
(220, 22, 224, 226 and 228). The coordinator 216 is responsible for coordinating the 
particular data (i.e. the tables identified in data copy table 218) that is retrieved by 
each of the successful snapshot workers (206, 208, 210, 212 and 214). 

When any flat file of the plurality of flat files (220, 22, 224, 226 and 228) has 

10 been completed, one of aplurality of loader workers (230, 232, 234, 236 and 238) 
load the flat files into one of a plurality of snapshot tables (1 ..N) in snapshot table 
database 240. The plurality of snapshot tables (1 ..N) in snapshot table database 240 
are mapped on a one-to-one basis with the desired database tables (1 ..N) in database 
204, In certain embodiments, the loader workers are SQL loaders that write data 

1 5 from the flat files directly into the snapshot tables (1 ..N) in the snapshot table 
database 240. 

Before the data from a flat file thus created can be loaded into a particular 
snapshot table in snapshot table database 240, any data that was previously stored in 
the particular snapshot table (i.e. "stale data" from a prior execution of the planning 
20 software process 202) must first be removed. To perform this task, a plurality of 
delete workers (242, 244, 246 and 248) are used to delete previously stored snapshot 
tables from snapshot table database 240. 
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Once the previously stored snapshot tables are deleted and the flat files are 
loaded into the snapshot tables in the snapshot table database 240, snapshot table 
database 240 contains a consistent snapshot of the desired tables identified in data 
copy table 21 8 as of a particular point in time. 

In a preferred embodiment, once the desired tables are copied into the flat 
files, the memory based planner 202 reads the flat file information to generate a 
planning schedule. In an alternative embodiment, the memory based planner 202 
waits for the flat files to be copied into the snapshot tables in snapshot table database 
240 and then reads the snapshot table information to generate a planning schedule. 

SEQUENCE FOR PRODUCING A COPY OF A DATABASE 
Figure 3 A and 3B are flow diagrams illustrating a method for producing a 
copy of desired data from database 204 according to an embodiment of the 
invention. For explanation purposes, the flow diagrams of Figures 3 A and 3B are 
described with reference to the components of Figure 2. 

At step 302, a memory based planner 202 is executed by a user to generate a 
planning schedule that is based upon data firom database 204. At step 304, the 
memory based planner 202 causes a snapshot worker 206 to be launched. 

At step 306, the snapshot worker 206 determines which tables of database 
204 are required by the memory based planner 202 for generating the planning 
schedule. Using this information, the snapshot worker 206 generates a copy table list 
218. 
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In one embodiment, each entry in the copy table list 218 includes a table ID 
250, a delete start time 252 and a delete complete time 254. The table ID 250 
identifies a particular table in database 204 that is required by the memory based 
planner 202 for the generation of the planing schedule. The delete start time 252 

5 identifies a system timestamp in which a delete worker began to delete a 

corresponding snapshot table (to remove " stale data") in snapshot table database 
240, as identified by the particular table ID. For example, entry 256 of copy table 
list 218 indicates that a delete worker began to delete snapshot table 15 in snapshot 
table database 240 at the system time of 900. 

1 0 The delete complete time 254 identifies a system timestamp in which a delete 

worker completed the deletion of a snapshot table in snapshot table database 240, as 
identified by the particular table ID. For example, entry 256 of copy table list 218 
indicates that a delete worker completed the deletion of snapshot table 15 m 
snapshot table database 240 at the system time of 950. The delete start time 252 and 

1 5 delete complete time 254 entries are used by the coordinator 2 1 6 to determuie 
whether data can be loaded into a particular snapshot table (1..N). 

At step 308, the snapshot worker 206 spawns a coordinator process 216 for 
coordinating the copying of data from database 204. At step 309, the snapshot 
worker 206 notifies the coordinator 216 that the copy table list 218 has been created. 

20 At step 3 1 0, the coordinator 2 1 6 obtains locks on the tables identified m the copy 
table list 218. 

At step 312, after obtaining a lock on each table identified in copy table list 
218, the coordinator 216 requests a plurality of snapshot workers (206-214) to solicit 



50277-177 (3018-166) 



-20- 

snapshot times from the DBMS. In one embodiment, upon executing a memory 
based planner, the user may specify a particular number of snapshot workers for the 
coordinator 216 to request to solicit snapshot times. 

At step 314, the coordinator waits a certain period of time (T_WAIT) for the 

5 requested snapshot workers (206-214) to be assigned snapshot tunes. For example, 
in one embodiment, the coordinator 216 waits several minutes after requesting the 
plurality of snapshot workers (206-214) to solicit snapshot times from the DBMS. 

At step 316, after the (T_WAIT) period of time expires, the coordinator 
releases the locks on the tables identified in copy table list 218. 

10 At step 318, the coordinator 216 uses the copy table list 218 to assign the 

successfiil snapshot workers (in this example, snapshot workers 206-214), a 
particular set of data to copy from database 204. For example, snapshot worker 206 
may be assigned to copy database table " 1 7" from database 204. Once assigned a 
set of data, the successful snapshot workers (206-214) begin to copy the assigned 

1 5 data from database 204 into flat files (220-228). 

In certain embodiments, the coordinator 216 assigns the snapshot workers a 
particular set of data to copy before the (T_WAIT) period of time expires. For 
example, the coordinator could assign a particular copying task at the same time that 
the coordinator requests the snapshot workers to obtain snapshot times. Without 

20 waiting to the T_WAIT period to expire, each snapshot worker can begin the 
copying task as soon as (1) it is assigned the set of data copy, and (2) it has 
successfiiUy obtained a snapshot time. 
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At step 320, the coordinator launches delete workers (242-248) to delete 
"stale data" contained in the snapshot tables in snapshot table database 240. In 
deleting the "stale data", the delete workers use the copy table list 218 to identify 
which snapshot tables in snapshot table database 240 need to be deleted. In one 

5 embodiment, the delete workers (242-248) respectively update the delete start time 
252 and delete complete time 254 parameters for the corresponding entry as they 
begin and complete the deletion of a particular table identified in copy table list 218. 

For example, the delete worker 242 may use the information at entry 258 of 
copy table list 21 8 to identify a snapshot table " 7" in snapshot table database 240 as 

1 0 a table that needs to be deleted. The delete worker 242 then stores the current 

system time ("1000" in this example) in the delete start time 252 parameter of entry 
258 and begins deleting the snapshot table " 7" . When delete worker 242 completes 
the deletion of snapshot table " T , it stores the current system time (" 1130" in this 
example) in the delete complete time 254 parameter of entry 258. 

1 5 It should be noted that although the delete workers (242-248) are launched at 

step 320 in this example, the coordinator 216 may actually launch them prior to or 
even after step 320. For example, in one embodiment, the coordinator 216 launches 
the delete workers (242-248) immediately after it is notified that the copy table list 
21 8 has been created in step 308. 

20 At step 322, a snapshot worker notifies the coordinator 21 6 it has finished 

copying the data it was assigned. In one embodiment, the snapshot worker informs 
the coordinator 216 of the data that was copied (i.e. the particular database tables in 
database 204) and the location of the one or more flat files that contains the copy of 
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the data. Although Figure 2 depicts each snapshot worker using a single flat file for 
copying its assigned data, in certain embodiments, the snapshot workers may copy 
their assigned data to multiple flat files, informing the coordinator 216 each time one 
of the flat files has been completed. 

5 At step 324, the coordinator 2 1 6 determines whether the delete workers (242- 

248) have deleted the snapshot tables (i.e. "stale data") in the snapshot table database 
240 that correspond to the data that the snapshot worker has just completed copying. 
For example, if the snapshot worker informs the coordinator 216 that database tables 
"1", "5", and "8" in database 204 have just been copied to the flat file 222, the 

1 0 coordinator 2 1 6 determines whether the delete workers (242-248) have removed the 
"stale data" by deleting snapshot tables "1", "5" and "8" in snapshot table database 
240. 

In certain embodiments, the coordinator 216 uses the information in the copy 
table list 218 to determine if a particular snapshot table has been deleted. In one 

1 5 embodiment, the coordinator 216 uses the delete complete time 254 parameter to 
determine if a snapshot table has been deleted. 

If at step 324 the coordinator 216 determines that a snapshot table has not 
been deleted (i.e. snapshot tables "1", "5" and "8" for the previous example), then at 
step 326, the coordinator 216 delays the launching (e.g. spawning) of a loader 

20 worker (230-238) for loading the information until the snapshot tables " 1 ", "5" and 
"8" have been deleted. For example, assuming that snapshot worker 206 has notified 
coordinator 216 that it has copied database tables "1", "5" and "8" firom database 204 
into flat file 220, if coordinator 216 determines that snapshot table "5" in snapshot 
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table database 240 has not been deleted (i.e. the "stale data" has not been removed), 
then coordmator 216 will delay launching loader worker 230 for loading the 
information into snapshot tables "1", "5" and "8", until snapshot table "5" has been 
deleted. Control then proceeds to step 338. 
5 If at step 324, the coordinator 2 1 6 determines that the snapshot table has 

been deleted, then at step 328 the coordinator 216 launches a loader worker (230- 
238) to load the information from the flat file into the corresponding snapshot table 
in snapshot table database 240. 

At step 330, when a loader worker (230-238) finishes loading a flat file (220- 
1 0 228) into its corresponding snapshot file in snapshot table database 240, it notifies 
the coordinator 216. In certain embodiments, the loader workers (230-238) notify 
the coordinator 216 of the particular snapshot table in which the data was loaded. 

At step 332, the coordinator 216 determines whether all the desired tables 
identified in copy table list 218 have been copied fi:om database 204 into the flat 
15 files (220-228). 

If at step 332 the coordinator 216 determines that all the tables identified in 
copy table list 218 have been copied, then at step 334, the coordinator 216 notifies 
the memory based planner 202 that the flat files (220-228) contain a valid copy of 
tiie desired tables from database 204. At step 336, the memory based planner 202 
20 uses tiie information in the flat files (220-228) to generate the planning schedule. 

In an alternative embodunent, at step 332, the coordinator 216 determines 
whether all the desired tables identified in copy table list 218 have been copied from 
database 204 into the snapshot tables in snapshot table database 240. If the 
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coordinator 216 determines that all the tables identijfied in copy table list 218 have 
been copied, then at step 334, the coordinator 216 notifies the memory based planner 
202 that the snapshot table database 240 contains a valid copy of the desired tables 
from database 204. At step 336, the memory based planner 202 then uses the 
5 snapshot tables in snapshot table database 240 to generate the planning schedule. 

If at step 332 the coordinator 216 determines that all the tables identified in 
copy table list 218 have not been copied, then at step 338, the coordinator 216 
continues to accept and process other completion notifications from snapshot 
workers (206-214). When the coordinator 216 receives a notification that a snapshot 
10 worker (206-214) has finished copying its assigned data to a particular flat file, 

control proceeds to step 324 to determine whether the delete workers (242-248) have 
deleted the corresponding snapshot tables in the snapshot table database 240. 



PRODUCING A COPY WITHOUT LOCKING DATABASE TABLES 
1 5 In the embodiment described above, the desired tables are locked for a 

relatively brief period T_WAIT while snapshot workers obtain snapshot times. In 
an altemative embodiment, a mechanism is provided for producing a copy of desired 
tables ft'om a database without locking the desired tables. For example, in one 
embodiment, the following steps are performed to obtain a snapshot of a database 
20 without locking the desired tables: 

(1) the desired tables are identified; 

(2) a single snapshot worker process is requested to obtain a snapshot time; 
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(3) using the single snapshot worker^ a copy of desired tables is produced 
which reflects the state of the database associated with the snapshot time. 



Figure 4 depicts a block diagram of a system 400 that is used for producing a 
5 copy of a database in accordance with an embodiment of the present invention. 
Figure 4 is similar to Figure 2, and therefore like components have been numbered 
alike. 

The system 400 differs from that shown in Figure 2 in that system 400 
includes a single snapshot worker 206 which is used to retrieve data from database 
10 204. By using a single snapshot worker 206, only a single snapshot time is required. 
Because only a single snapshot time is required, the coordinator 216 is not required 
to lock the database tables in database 204. 

Figure 5 A and 5B are flow diagrams illustrating a method for producing a 
copy of desired data from database 204, without locking the database tables that 
1 5 contain the desired data, according to an embodiment of the invention. For 

explanation purposes, the flow diagrams of Figures 5 A and 5B are described with 
reference to the components of Figure 4. 

At step 502, a memory based planner 202 is executed by a user to generate a 
planning schedule that is based upon data from database 204. At step 504, the 
20 memory based planner 202 causes a snapshot worker 206 to be launched. 

At step 506, the snapshot worker 206 determines which tables of database 
204 are required by the memory based planner 202 for generating the planning 
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schedule. Using this information, the snapshot worker 206 generates a copy table 
list 218. 

In one embodiment, each entry in the copy table list 218 includes a table ID 
250, a delete start time 252 and a delete complete time 254. The table ID 250 

5 identifies a particular table in database 204 that is required by the memory based 
planner 202 for the generation of the planing schedule. The delete start time 252 
identifies a system time stamp in which a delete worker began to delete a 
corresponding snapshot table (to remove "stale data") in snapshot table database 
240, as identified by the particular table ID. For example, entry 256 of copy table 

1 0 list 2 1 8 indicates that a delete worker began to delete snapshot table 1 5 in snapshot 
table database 240 at the system time of 900. 

The delete complete time 254 identifies a system time stamp in which a 
delete worker completed the deletion of a snapshot table in snapshot table database 
240, as identified by the particular table ID. For example, entry 256 of copy table 

15 list 2 1 8 indicates that a delete worker completed the deletion of snapshot table 1 5 in 
snapshot table database 240 at the system time of 950. The delete start time 252 and 
delete complete time 254 entries are used by the coordinator 216 to determine 
whether data can be loaded into a particular snapshot table (1 .,N). 

At step 508, the snapshot worker 206 spawns a coordinator process 216 for 

20 coordinating the copying of data from database 204. At step 509, the snapshot 

worker 206 notifies the coordinator 216 that the copy table list 218 has been created. 

At step 510, the coordinator 216 requests the snapshot workers 206 to solicit 
a snapshot times from the DBMS. 
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At step 512, the coordinator waits for the snapshot worker 206 to be assigned 
a snapshot time. 

At step 514, the coordinator 216 uses the copy table Ust 218 to assign the 
snapshot worker 206, a particular set of data to copy from database 204. For 

5 example, snapshot worker 206 may be assigned to copy database table "17" from 
database 204. Once assigned a set of data, the snapshot worker 206 begins to copy 
the assigned data from database 204 into flat files (220-226). 

At step 516, the coordinator launches delete workers (242-248) to delete 
"stale data" contained in the snapshot tables in snapshot table database 240. In 

10 deleting the "stale data", the delete workers use the copy table list 218 to identify 
which snapshot tables in snapshot table database 240 need to be deleted. In one 
embodiment, the delete workers (242-248) respectively update the delete start time 
252 and delete complete time 254 parameters for the corresponding entry as they 
begin and complete the deletion of a particular table identified in copy table Hst 218. 

1 5 For example, the delete worker 242 may use the information at entry 258 of 

copy table list 21 8 to identify a snapshot table " T in snapshot table database 240 as 
a table that needs to be deleted. The delete worker 242 then stores the current 
system time ("1000" in this example) in the delete start time 252 parameter of entry 
258 and begins deleting the snapshot table " T . When delete worker 242 completes 

20 the deletion of snapshot table " T , it stores the current system time (" 1130" in this 
example) in the delete complete time 254 parameter of entry 258. 

It should be noted that although the delete workers (242-248) are launched at 
step 516 in this example, the coordinator 216 may actually launch them prior to or 
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even after step 516. For example, in one embodiment, the coordinator 216 launches 
the delete workers (242-248) immediately after it is notified that the copy table list 
218 has been created in step 508. 

At step 518, the snapshot worker 206 notifies the coordinator 216 it has 
finished copying the data it was assigned. In one embodiment, the snapshot worker 
informs the coordinator 216 of the data that was copied (i.e. the particular database 
tables in database 204) and the location of the one or more flat files that contains the 
copied data. As illustrated by flat files (220-226), snapshot worker 206 may copy its 
assigned data to multiple flat files, informing the coordinator 216 each time one of 
the flat files has been completed. 

At step 520, the coordinator 216 determines whether the delete workers (242- 
248) have deleted the snapshot tables (i.e. removed the "stale data") in the snapshot 
table database 240 that correspond to the data that the snapshot worker has just 
completed copying. For example, if the snapshot worker 206 informs the 
coordmator 216 that database tables "1", "5", and "8" in database 204 have just been 
respectively copied to the flat files 222, 224 and 226, the coordinator 216 determines 
whether the delete workers (242-248) have deleted snapshot tables "1", "5" and "8" 
in snapshot table database 240, thereby removing the "stale data". 

In certain embodiments, the coordinator 216 uses the information in the copy 
table list 21 8 to determine if a particular snapshot table has been deleted. In one 
embodiment, the coordmator 216 uses the delete complete time 254 parameter to 
determine if a snapshot table has been deleted. 
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If at step 520 the coordinator 216 determines that a snapshot table has not 
been deleted (i.e. snapshot tables "1", "5" and "8" for the previous example), then at 
step 522, the coordinator 216 delays the launching (e.g. spawning) of a loader 
worker (230-238) for loading the information until the snapshot tables "1", "5" and 
5 "8" have been deleted. For example, assuming that snapshot worker 206 has notified 
coordinator 216 that it has copied database tables "1", "5" and "8" fi:om database 204 
into flat file 220, if coordinator 216 determines that snapshot table "5" in snapshot 
table database 240 has not been deleted (i.e. the "stale data" has not been removed), 
then coordinator 216 will delay launching loader worker 230 for loading the 
10 information into snapshot tables "1", "5" and "8", until snapshot table "5" has been 
deleted. Control then proceeds to step 534. 

If at step 520, the coordinator 216 determines that the snapshot table has 
been deleted, then at step 524 the coordinator 216 launches a loader worker (230- 
238) to load the information firom the flat file into the corresponding snapshot table 
1 5 in snapshot table database 240. 

At step 526, when a loader worker (230-238) finishes loading a flat file (220- 
228) into its corresponding snapshot file in snapshot table database 240, it notifies 
the coordinator 216. In certain embodiments, the loader workers (230-238) notify 
the coordinator 216 of the particular snapshot table in which the data was loaded. 
20 At step 528, the coordinator 2 1 6 determines whether all the desired tables 

identified in copy table hst 218 have been copied from database 204 into the 
snapshot tables in snapshot table database 240. 
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If at step 528 the coordinator 216 determines that all the tables identified in 
copy table list 218 have been copied, then at step 530, the coordinator 216 notifies 
the memory based planner 202 that the snapshot table database 240 contains a valid 
copy of the desired tables fi-om database 204. At step 532, the memory based 

5 planner 202 uses the snapshot tables in snapshot table database 240 to generate the 
planning schedule. 

Conversely, if at step 528 the coordinator 216 determines that all the tables 
identified in copy table list 218 have not been copied, then at step 534, the 
coordinator 216 continues to accept and process other completion notifications fi-om 

1 0 snapshot workers (206-214). When the coordinator 216 receives a notification that a 
snapshot worker (206-214) has finished copying its assigned data to a particular flat 
file, control proceeds to step 520 to determine whether the delete workers (242-248) 
have deleted the corresponding snapshot tables in the snapshot table database 240. 

1 5 COPYING DATA DIRECTLY INTO ANOTHER DATABASE 

Certain database systems provide snapshot mechanisms for transactions that 
modify the database, as well as for "read only" transactions. Such a mechanism is 
described in U.S. Patent Application No. 08/613,026, filed March 11, 1996, Entitled 
"Method and Apparatus for Providing Isolation Levels in a Database" , the contents 

20 of which are incorporated herein by this reference. In certain embodiments, such a 
snapshot mechanism is used to allow the snapshot workers to copy the desired tables 
into a database without the use of flat files. 
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Figure 6 depicts a block diagram of a system 600 that is used for producing a 
copy of a database in accordance with an embodiment of the present invention. 
Figure 6 is similar to Figure 2, and therefore like components have been numbered 
alike. 

5 As illustrated in figure 6, snapshot workers (206-214) retrieve database table 

information from database 204 and copy it directly into the snapshot table database 
240. By writing directly to the snapshot table database 240, the intermediate steps of 
copying data to flat files and launching loader workers to copy that data from the flat 
files into the snapshot table database 240 can be eliminated. In one embodiment, the 

10 snapshot workers (206-214) copy the database tables (L.N) in database 204 directly 
into temporary files in snapshot table database 240. These temporary files are used 
by the memory based planner 202 for generating a planning schedule. 

In an alternative embodiment, the snapshot workers (206-214) copy the 
desired database tables (1..N) in database 204 directly into snapshot table database 

15 240 as "large binary objects" (BLOBs). In general, a BLOB is a data item that 
consists of a large amount of data and may be used by systems that require data 
types that are typically much larger than traditional data types. For example, a 
single BLOB may include four gigabytes of data and may be thought of as a file or a 
stream of characters or bytes. 

20 In certain embodiments, snapshot workers (206-214) retrieve database table 

information from database 204 and copy it as separate data back into database 204. 
By copying the database table information from database 204 as separate data in 
database 204, the intermediate steps of copying the data to flat files can be 
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eliminated. In one embodiment, the desired database tables (1..N) in database 204 
are directly copied as separate BLOBs in database 204. 

In the foregoing specification, the invention has been described with 
5 reference to specific embodiments thereof It will, however, be evident that various 
modifications and changes may be made thereto without departing from the broader 
spirit and scope of the invention. The specification and drawings are, accordingly, 
to be regarded in an illustrative rather than a restrictive sense. 



10 
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CLAIMS 

What is claimed is: 



1 1 . A method for supplying a consistent set of data to a software application, the 

2 method comprising the steps of: 

3 launching said software application; 

4 identifying a particular set of data that is required by the software 

5 application; 

6 requesting a first process to obtain a snapshot time from a database server 

7 associated with a first database, wherein the snapshot time causes all 

8 subsequent reads of said first database by the first process to retum 

9 data that reflects a database state associated with the snapshot time; 

10 after the first process obtains the snapshot time, causing the first process to 

1 1 extract the particular set of data firom the first database; and 

12 supplying said software appUcation with the particular set of data that was 

1 3 extracted fi"om the first database. 

1 2. The method of Claim 1 , fiarther comprising the step of causing a second 

2 process to store the particular set of data in a second database. 



1 3. The method of Claim 1, wherein the step of identifying the particular set of data 

2 includes the step of creating a copy table hst, wherein the copy table Ust contains 

3 entries that identify the particular set of data in the first database. 
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1 4. The method of Claim 2, wherein the step of causing the second process to 

2 store the particular set of data in the second database includes the steps of: 

3 writing the particular set of data to one or more flat files; and 

4 executing a loader process, wherein the loader process loads the particular set 

5 of data from the one or more flat files to the second database. 

1 5. The method of Claim 3, further comprising the steps of: 

2 executing a delete process, wherein the delete process uses the copy table hst 

3 to identify data that needs to be deleted in a second database; and 

4 deleting the identified data from the second database. 

1 6. The method of Claim 4, wherein the step of writing the particular set of data 

2 to one or more flat files includes the steps of: 

3 the first process informing a coordinator process when it has finished writing 

4 data to a particular flat file; and 

5 the coordinator using the information to tell the loader process when it can 

6 begin loading the flat file into the second database. 

1 7. The method of Claim 4, wherein the step of: 

2 writing the particular set of data to the flat file includes the step of writing 

3 the particular set of data to a plurality of flat files; and 

4 executing the loader process includes the step of executing a plurality of 

5 loader processes, wherein the plurality of loader processes load the 
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6 particular set of data from the plurality of flat files to the second 

7 database. 

18. The method of Claim 2, wherein the step of supplying said software 

2 apphcation with data from said particular set of data includes the steps of: 

3 said software apphcation reading the particular set of data stored in the 

4 second database; and 

5 said software apphcation generating a planning schedule based on the 

6 particular set of data. 

1 9. The method of Claim 3, where the step of creating the copy table hst 

2 includes the steps of: 

3 communicating with the software apphcation to identifying a set of planning 

4 data, where the planning data is required for generating a planning 

5 schedule; and 

6 creating the copy table list based on the identified set of planning data. 

1 10. The method of Claim 1, wherein the step of supplying said software 

2 apphcation with data from said particular set of data includes the steps of: 

3 writing the particular set of data to one or more flat files; and 

4 supplying the one or more flat files to said software application, wherein said 

5 software apphcation generates a planning schedule based on 

6 information contained in the one or more flat files . 
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1 11. A method for producing a copy of data from a first database, the method 

2 comprising the steps of: 

3 locking a first set of data in the first database; 

4 after locking the first set of data, 

5 requesting a plurality of processes to obtain snapshot times firom a 

6 database server associated with said first database, wherein 

7 the snapshot times cause all subsequent reads of the first 

8 database by the plurality of processes to return data fi*om the 

9 first database as of said snapshot times; 

10 waiting a particular period of time for the plurality of processes to be 

1 1 assigned snapshot times; 

12 releasing the locks on the first set of data in the first database; 

13 using a successfiil set of said plurality of processes to extract a copy 

14 of the first set of data firom the first database, wherein said 

1 5 successfizl set includes only those processes of the plurality of 

1 6 processes that were assigned a snapshot time within the 

1 7 particular period of time; and 

1 8 storing the copy of the first set of data separate fi*om said first of data. 

1 12. The method of Claim 1 1, wherein the step of identifying the first set of data 

2 includes the step of creating a copy table hst, wherein the copy table list 

3 contains entries that identify the first set of data in the first database. 
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1 13. The method of Claim 1 2, where the step of creating the copy table list 

2 includes the steps of: 

3 identifying a set of planning data, where the planning data is required to 

4 generate a planning schedule; and 

5 creating the copy table list based on the planning data required to generate 

6 the planning schedule. 

1 14. The method of Claim 11, wherein the step of storing the copy of the first set 

2 of data includes the steps of: 

3 writing the copy of the first set of data to a plurality of flat files; and 

4 executing a plurality of loader processes, wherein the plurality of loader 

5 processes load the copy of the first set of data from the plurality of 

6 flat files to a second database. 

1 15. The method of Claim 1 2, further comprising the steps of: 

2 executing a plurality of delete processes, wherein the plurality of delete 

3 processes use the copy table list to identify data that needs to be 

4 deleted in a second database; and 

5 deleting the identified data fi-om the second database. 
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1 16. The method of Claim 11, wherein the step of requesting the plurality of 

2 processes to obtain a snapshot time includes the step of requesting the 

3 pluraUty of processes based on a user input parameter, wherein the user input 

4 parameter identifies how many processes should be requested to obtain a 

5 snapshot time. 

1 17. The method of Claim 1 1 , wherein the step of extracting the copy of the first 

2 set of data from the database includes the steps of: 

3 assigning a set of copy data to the plurality of snapshot processes; and 

4 retrieving data from the first database based on the set of copy data that was 

5 assigned to the plurality of snapshot processes. 

1 1 8. The method of Claim 14, wherein: 

2 the steps of writing the copy of the first set of data to a pliurality of flat files 

3 fiirther includes the step of notifying a coordinator process that data 

4 has been written to one of the plurality of flat files; and 

5 the steps of executing the plurality of loader processes fiirther includes the 

6 step of the coordinator, upon being notified that data has been written 

7 to one of the plurality of flat files, launching a loader process to load 

8 the first set of data from one of the plurality of flat files to the second 

9 database. 
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1 19. The method of Claim 14, wherein the step of writing the copy of the first set 



2 of data to the plnraUty of flat files includes the steps of: 

3 the plurality of process informing a coordinator process when it has finished 

4 writing data to a particular flat file; and 

5 the coordinator using the information to tell one of the plurahty of loader 

6 processes when it can begin loading the particular flat file into the 

7 second database. 



1 20. The method of Claim 1 1 , wherein the step of storing the copy of the first set 

2 of data includes the steps of storing the copy of the first set of data as blob 

3 files that are separate fi-om said first of data. 

1 21. The method of Claim 1 1 , wherein the step of storing the copy of the first set 

2 of data includes the steps of storing the copy of the first set of data in said 

3 first of data. 
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1 22. A computer-readable medium carrying one or more sequences of one or more 

2 instructions for supplying a consistent set of data to a software application, 

3 the one or more sequences of one or more instructions including instructions 

4 which, when executed by one or more processors, cause the one or more 

5 processors to perform the steps of: 

6 launching said software application; 

7 identifying a particular set of data that is required by the software application; 

8 requesting a first process to obtain a snapshot time ft"om a database server associated 

9 with a first database, wherein the snapshot time causes all subsequent reads 

10 of said first database by the first process to return data that reflects a database 

1 1 state associated with the snapshot time; 

12 after the first process obtains the snapshot time, causing the first process to extract 

1 3 the particular set of data fi:om the first database; and 

14 supplying said software appUcation with the particular set of data that was extracted 

1 5 from the first database. 

1 23 . The computer-readable medium of Claim 22, wherein the computer-readable 

2 medium fiirther comprises instructions for performing the step of causing a 

3 second process to store the particular set of data in a second database. 



50277-177 (3018-166) 



-41- 



1 24. The computer-readable medium of Claim 22, wherein the step of 

2 identifying the particular set of data includes the step of creating a copy 

3 table list, wherein the copy table Hst contains entries that identify the 

4 particular set of data in the first database. 

1 25. The computer-readable medium of Claim 23, wherein the step of causing the 

2 second process to store the particular set of data in the second database 

3 includes the steps of: 

4 writing the particular set of data to one or more flat files; and 

5 executing a loader process, wherein the loader process loads the particular set 

6 of data from the one or more flat files to the second database. 

1 26. The computer-readable medium of Claim 25, wherein the step of writing the 

2 particular set of data to one or more flat files includes the steps of: 

3 the first process informing a coordinator process when it has finished writing 

4 data to a particular flat file; and 

5 the coordinator using the information to tell the loader process when it can 

6 begin loading the flat file into the second database. 
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1 27. A computer-readable medium carrying one or more sequences of one or more 

2 instructions for producing a copy of data from a first database, the one or 

3 more sequences of one or more instructions including instructions which, 

4 when executed by one or more processors, cause the one or more processors 

5 to perform the steps of: 

6 locking a first set of data in the first database; 

7 after locking the first set of data, 

8 requesting a plurality of processes to obtain snapshot times from a database server 

9 associated with said first database, wherein the snapshot times cause all 

1 0 subsequent reads of the first database by the plurality of processes to return 

1 1 data from the first database as of said snapshot times; 

12 waiting a particular period of time for the plurality of processes to be assigned 

13 snapshot times; 

14 releasing the locks on the first set of data in the first database; 

15 using a successfiil set of said plurality of processes to extract a copy of the first set of 

16 data from the first database, wherein said successfiil set includes only those 

1 7 processes of the plurality of processes that were assigned a snapshot time 

1 8 within the particular period of time; and 

1 9 storing the copy of the first set of data separate from said first of data. 
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1 28 . The computer-readable medium of Claim 27, wherein the step of identifying 

2 the first set of data includes the step of creating a copy table list, wherein the 

3 copy table list contains entries that identify the fnst set of data in the first 

4 database. 

1 29. The computer-readable medium of Claim 27, wherein the step of storing the 

2 copy of the first set of data includes the steps of: 

3 writing the copy of the first set of data to a plurality of flat files; and 

4 executing a plurality of loader processes, wherein the plurality of loader 

5 processes load the copy of the first set of data firom the plurality of 

6 flat files to a second database. 

1 30. The computer-readable medium of Claim 27, wherein the step of extracting 

2 the copy of the first set of data firom the database includes the steps of: 

3 assigning a set of copy data to the plurality of snapshot processes; and 

4 retrieving data firom tiie first database based on the set of copy data that was 

5 assigned to the plurality of snapshot processes. 
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1 31. The computer-readable medium of Claim 29, wherein the step of writing the 

2 copy of the first set of data to the pluraUty of flat files includes the steps of: 

3 the plurality of process informing a coordinator process when it has finished writmg 

4 data to a particular flat file; and 

5 the coordinator using the information to tell one of the plurality of loader processes 

6 when it can begin loading the particular flat file into the second database. 
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1 32, A computer system for supplying a consistent set of data to a software 

2 application, the computer system comprising: 

3 a memory; 

4 one or more processors coupled to the memory; and 

5 a set of computer instructions contained in the memory, the set of computer 

6 instructions including computer instructions which when executed by 

7 the one or more processors, cause the one or more processors to 

8 perform the steps of: 

9 launching said software appUcation; 

1 0 identifying a particular set of data that is required by the software 

1 1 appUcation; 

12 requesting a first process to obtain a snapshot time from a database 

1 3 server associated with a first database, wherein the snapshot 

14 time causes all subsequent reads of said first database by the 

1 5 first process to return data that reflects a database state 

1 6 associated with the snapshot time; 

17 after the first process obtains the snapshot time, causing the first 

1 8 process to extract the particular set of data from the first 

19 database; and 

20 supplying said software apphcation with the particular set of data that 

21 was extracted from the first database. 
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1 33 . The computer system of Claim 32, further including instructions for 

2 performing the step of causing a second process to store the particular set of 

3 data in a second database. 

1 34. The computer system of Claim 32, wherein the step of identifying the 

2 particular set of data includes the step of creating a copy table list, wherein 

3 the copy table list contains entries that identify the particular set of data in 

4 the first database. 

1 35. The computer system of Claim 3 3 , wherein the step of causing the second 

2 process to store the particular set of data in the second database includes the 

3 steps of: 

4 writing the particular set of data to one or more flat files; and 

5 executing a loader process, wherein the loader process loads the particular set 

6 of data fi-om the one or more flat files to the second database. 

1 36. The computer system of Claim 35, wherein the step of writing the particular 

2 set of data to one or more flat files includes the steps of: 

3 the first process informing a coordinator process when it has finished writing data to 

4 a particular flat file; and 

5 the coordinator using the information to tell the loader process when it can begin 

6 loading the flat file into the second database. 
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1 37, A computer system for producing a copy of data from a first database, the 

2 computer system comprising: 

3 a memory; 

4 one or more processors coupled to the memory; and 

5 a set of computer instructions contained in the memory, the set of computer 

6 instructions including computer instructions which when executed by 

7 the one or more processors, cause the one or more processors to 

8 perform the steps of: 

9 locking a first set of data in the first database; 

1 0 after locking the first set of data, 

1 1 requesting a plurality of processes to obtain snapshot times 

12 from a database server associated with said first 

1 3 database, wherein the snapshot times cause all 

1 4 subsequent reads of the first database by the plurality of 

1 5 processes to return data from the first database as of said 

16 snapshot times; 

1 7 waiting a particular period of time for the plurality of 

18 processes to be assigned snapshot times; 

19 releasing the locks on the first set of data in the first database; 

20 using a successful set of said plurality of processes to extract a 

21 copy of the first set of data from the first database, 

22 wherein said successful set includes only those 

23 processes of the plurality of processes that were 
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24 assigned a snapshot time within the particular period of 

25 time; and 

26 storing the copy of the first set of data separate from said first 

27 of data. 

1 38. The computer system of Claim 37, wherein the step of identifying the first set 

2 of data includes the step of creating a copy table hst, wherein the copy table 

3 list contains entries that identify the first set of data in the first database. 

1 39. The computer system of Claim 37, wherein the step of storing the copy of the 

2 first set of data includes the steps of: 

3 writing the copy of the first set of data to a plurality of flat files; and 

4 executing a plurality of loader processes, wherein the plurality of loader processes 

5 load the copy of the first set of data from the plurality of flat files to a second 

6 database. 

1 40. The computer system of Claim 37, wherein the step of extracting the copy of 

2 the first set of data from the database includes the steps of: 

3 assigning a set of copy data to the plurality of snapshot processes; and 

4 retrieving data from the first database based on the set of copy data that was 

5 assigned to the plurality of snapshot processes. 
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1 41 . The computer system of Claim 39, wherein the step of writing the copy of 

2 the first set of data to the plurality of flat files includes the steps of: 

3 the plurality of process informing a coordinator process when it has finished 

4 writing data to a particular flat file; and 

5 the coordinator using the information to tell one of the plurality of loader 

6 processes when it can begin loading the particular flat file into the 

7 second database. 
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ABSTRACT OF THE DISCLOSURE 

According to one aspect of the invention, a method and apparatus for 
supplying a consistent set of data to a software apphcation is provided. 

According to the method, a software application is launched that requires a 
5 particular set of data contained in a first database. Once the particular set of data is 
identified, a first process is requested to obtain a snapshot time from a database 
server associated with the first database. The snapshot time causes all subsequent 
reads of the first database by the first process to return data that reflects a database 
state associated with the snapshot time. After the first process obtains the snapshot 

10 time, the first process extracts the particular set of data from the first database. The 
software application is then suppKed with the particular set of data that was extracted 
from the first database. 

According to another aspect of the invention, a method and apparatus for 
producing a copy of data from a first database is provided. 

1 5 According to the method, a first set of data in the first database is locked* 

After locking the first set of data, a plurality of processes are requested to obtain 
snapshot times from a database server associated with said first database. The 
snapshot times cause all subsequent reads of the first database by the plurality of 
processes to return data from the first database as of the snapshot times. After 

20 waiting a particular period of time for the plurality of processes to be assigned 
snapshot times, the locks on the first set of data in the first database are released. 
The plurality of processes that were successfiil in obtaining a snapshot time within 
the particular period of time are used to extract a copy of the first set of data from 
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the first database. The copy of the first set of data is then separately stored from the 
25 first of data. 
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below nanned inventor, I hereby declare that: 

residence, post office address and citizenship are as stated below, next to my name. 

believe 1 am the original, first, and sole inventor (if only one name is listed below) or an original, first, and 
joint inventor (if plural names are listed below) of the subject matter which is claimed and for which a 
patent is sought on the invention entitled 

"METHOD AND APPARATUS FOR COPYING DATA THAT RESIDES IN A DATABASE" 

the specification of which 

X is attached hereto. 

X was filed on February 27, 1998 as 

United States Application Number 09/032,095. 

I hereby state that I have reviewed and understand the contents of the abovendentified specification, 
including the claims(s), as amended by any amendment referred to above. 

I acknowledge the duty to disclose all information known to me to be material to patentability as defined in 
Title 37, Code of Federal Regulations, Section 1 .56 (copy attached). 

I hereby claim foreign priority benefits under Title 35, United States Code, Section 1 19(a)-(d), on any 
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foreign application for patent or inventor's certificate having a filing date before that of the application on 
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the prior United States application in the manner provided by the first paragraph of Title 35, United States 
Code, Section 1 12, 1 acl<nowledge the duty to disclose all information known to me to be material to 
patentability as defined in Title 37, Code of Federal Regulations, Section 1 .56 (copy attached) which 
became available between the filing date of the prior application and the national or PCT International filing 
date of this application: 



(Application Number) 



(Filing Date) 



(Status - patented, pending, abandoned) 



(Application Number) 



(Filing Date) 



(Status - patented, pending, abandoned) 



1 hereby declare that all statements made herein of my own knowledge are true and that all statements made 
on information and belief are believed to be true; and further that these statements were made with the 
knowledge that willful false statements and the like so made are punishable by fine or imprisonment, or both, 
under Section 1001 of Title 18 of the United States Code and that such willful false statements may 
jeopardize the validity of the application or any patent issued thereon. 



Full Name of Sole/First Inventor (given name, family name) Nadeem Sved 
Inventor's Signature TL^ Date _ ' 
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(City, State) 
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Title 37, Code of Federal Regulations, Section 1 .56 
Duty to Disclose Information Material to Patentability 



(a) A patent by its very nature is affected with a public interest. The public interest is best served, 
and the most effective patent examination occurs when, at the time an application is being examined, the 
Office Is aware of and evaluates the teachings of all information material to patentability. Each individual 
associated with the filing and prosecution of a patent application has a duty of candor and good faith in 
dealing with the Office, which includes a duty to disclose to the Office all information known to that 
individual to be material to patentability as defined in this section. The duty to disclose information exists 
with respect to each pending claim until the claim is canceled or withdrawn from consideration, or the 
application becomes abandoned. Information material to the patentability of a claim that is canceled or 
withdrawn from consideration need not be submitted if the information is not material to the patentability of 
any claim remaining under consideration in the application. There is no duty to submit information which 
is not material to the patentability of any existing claim. The duty to disclose all information known to be 
material to patentability is deemed to be satisfied if all information known to be material to patentability of 
any claim issued in a patent was cited by the Office or submitted to the Office in the manner prescribed by 
§§ 1.97(b)-(d) and 1 .98. However, no patent will be granted on an application in connection with which 
fraud on the Office was practiced or attempted or the duty of disclosure was violated through bad faith or 
intentional misconduct. The Office encourages applicants to carefully examine: 

(1) Prior art cited in search reports of a foreign patent office in a counterpart application, and 

(2) The closest information over which individuals associated with the filing or prosecution of a 
patent application believe any pending claim patentably defines, to make sure that any material 
information contained therein is disclosed to the Office. 

(b) Under this section, information is material to patentability when it is not cumulative to information 
already of record or being made of record in the application, and 

(1 ) It establishes, by itself or in combination with other information, a prima facie case of 
unpatentability of a claim; or 

(2) It refutes, or is inconsistent with, a position the applicant takes in: 

(i) Opposing an argument of unpatentability relied on by the Office, or 

(ii) Asserting an argument of patentability. 

A prima facie case of unpatentability is established when the information compels a conclusion that a 
claim is unpatentable under the preponderance of evidence, burden-of-proof standard, giving each term in 
the claim its broadest reasonable construction consistent with the specification, and before any 
consideration is given to evidence which may be submitted in an attempt to establish a contrary 
conclusion of patentability. 

( c ) individuals associated with the filing or prosecution of a patent application within the meaning 
of this section are: 

(1) Each inventor named in the application; 

(2) Each attorney or agent who prepares or prosecutes the application; and 
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(3) Every other person who is substantively involved in the preparation or prosecution of the 
application and who is associated with the inventor, with the assignee or with anyone to whom there is an 
obligation to assign the application. 

(d) Individuals other than the attorney, agent or inventor may comply with this section by disclosing 
information to the attorney, agent, or inventor. 
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